A measurement of similarity to identify identical code clones
نویسندگان
چکیده
Code clones are described as a part of the program which is completely or partially similar to the other portions. In the earlier research the code clones have been detected using fingerprinting technique. The major challenge in our work was to group the code clones based on similarity measure. The proposed system measures the similarity based on similarity distance. The defined expression considers two parameters for calculating the similarity measure namely the similarity distance and the population of the clone. Thereby the code clones are clustered and ranked on the basis of their similarity measures. Indexing is used to interactively identify the clones which are caused due to inconsistent changes. As a result of this work all the identical clusters for most similar and more similar categories are identified.
منابع مشابه
Detecting and Measuring Similarity in Code Clones
Most previous work on code-clone detection has focused on finding identical clones, or clones that are identical up to identifiers and literal values. However, it is often important to find similar clones, too. One challenge is that the definition of similarity depends on the context in which clones are being found. Therefore, we propose new techniques for finding similar code blocks and for qu...
متن کاملREPRESENTATION, ANALYSIS, AND REFACTORING TECHNIQUES TO SUPPORT CODE CLONE MAINTENANCE by ROBERT AMBROSIUS TAIRAS BARRETT BRYANT, COMMITTEE CHAIR
The existence of code cloning, which represents identical or near identical sections of code, has been documented to occur frequently in software systems. The main purpose of cloning is the reuse of a code fragment that performs some functionality by copying and pasting it to a new location in the source code. Code clones embody a unique form of coupling, because their similarity inadvertently ...
متن کاملPredicting Risky Clones Based on Machine Learning
Code clones are similar or identical code fragments to one another in source code. It is said that code clones decrease maintainability of software. On the other hand, all the code clones are not necessarily harmful to software. In this study, we propose a method to identify risky code clones out of all the code clones in source code by using machine learning techniques. Our proposed method lea...
متن کاملDetecting Functional Similarity between Java Files Using Metrics
Two similar pieces of software code is called clones. Software Developers often copy a section of code, and then paste it with or without modification .Software clone detection is employed to lessen the software maintenance cost and to improve understandability of the system. It also helps in plagiarism detection. Many code clone detection techniques exist and they detect and identify various t...
متن کاملEmpirical Studies of Code Clone Genealogies
Two identical or similar code fragments form a clone pair. Previous studies have identified cloning as a risky practice. Therefore, a developer needs to be aware of any clone pairs so as to properly propagate any changes between clones. A clone pair experiences many changes during the creation and maintenance of software systems. A change can either maintain or remove the similarity between clo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Int. Arab J. Inf. Technol.
دوره 12 شماره
صفحات -
تاریخ انتشار 2015